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INTRODUCTION 

The first crude estimates for the magnitude of the three lightest quark masses 
appeared more than 20 years ago:0 

m u ~ 4MeV m d ~ 6MeV m s ~ 135 MeV,! 

m u ~ 4.2 MeV m d ~ 7.5 MeV m s ~ 150MeV.i 

Many papers dealing with the pattern of quark masses have appeared since then, but 
the numbers have barely changed. In the first part of this lecture, I discuss the ar- 
guments that lead to the above pattern, emphasizing qualitative aspects concerning 
the breaking of isospin and eightfold way symmetries. The second part deals with the 
chiral perturbation theory results for the masses of the pseudoscalar octet, which lead 
to very stringent constraints for the relative size of the light quark masses, i.e. for the 
ratios m u /md and m s jm d . Finally, I briefly comment on their absolute magnitude and 
then draw some conclusions. 



MASS SPECTRUM OF THE PSEUDOSCALARS 

The best determinations of the relative size of m u , m d and m s rely on the fact that 
these masses happen to be small, so that the properties of the theory may be analysed 
by treating the quark mass term in the Hamiltonian of QCD as a perturbation. The 
Hamiltonian is split into two pieces: 



Hqpd = H Q + Hi , 



where H describes the three lightest quarks as massless and H 1 is the corresponding 
mass term, 

Hi = J d 3 x{m u uu + m d dd + m s ss} . 

Hq is invariant under the group 811(3)^x811(3)^ of independent flavour rotations of 
the right- and lefthanded quark fields. The symmetry is broken spontaneously: the 
eigenstate of Hq with the lowest eigenvalue, |0), is invariant only under the subgroup 
SU(3) V C SU(3) i{ x SU(3)£,. Accordingly, the spectrum of Hq contains eight Goldstone 
bosons, ir"* 1 , ir°, K ± , K°, K°, r\. The remaining levels form degenerate multiplets of 
SU(3) V of non-zero mass. 

The perturbation H\ breaks the symmetry, because it connects the right- and left- 
handed components: uu = u R u L +h.c. In so far as the quark masses m u , m d , m s are small, 
the entire term Hi amounts to a small perturbation, so that the group 811(3)^x811(3)^ 
still represents an approximate symmetry of the full Hamiltonian. The perturbation 
splits the SU(3) multiplets, in particular also the Goldstone boson octet. To first 
order in the perturbation, the square of the pion mass is given by the expectation 
value of the perturbation, M 2 + = (n + \m u uu + m d dd + m s ss\ii + ) and is therefore 
linear in the quark masses. Since the eigenstates entering here are those of H , the 
matrix elements respect SU(3) symmetry. In particular, isospin conservation requires 
(n + \uu\n + ) = dd\n + ) = B . Moreover, since the subgroup SU(2) fl xSU(2) L becomes 
an exact symmetry for m u , m d ^0, m s ^ 0, the pion mass must disappear in this limit, 
so that the matrix element (n + \ss\iT + } vanishes. The expansion of M 2 + in powers of 
m u ,md,m s thus starts with 

M 2 + = (m u + m d )B{l + 0(m u ,m d ,m s )} . (1) 

The operation d— >s takes the 7r + into the K + , and the K° may be reached from there 
with u—>d. Hence the corresponding lowest order mass formulae read 

M 2 K+ = (m u + m s )B + . . . , M 2 R0 = (m d + m s )B + . . . (2) 

In the ratios M 2 + : M R+ : M^ , the constant B drops out, so that the quark mass 
ratios must approximately be given by 

^ Ml + - M|p + M 2 K+ ^ Q g6 m. ^ M| + M% + - M 2 + ^ ^ 
m d Ml + + M 2 K0 - M 2 K+ ' ' m d M 2 K , - M 2 K+ + M 2 + 

to be compared with m^/m e ~ 200. These numbers represent rough estimates. The 
corrections generated by the higher order terms in the mass formulae as well as those 
due to the electromagnetic interaction are treated below - they do not significantly 
modify the above ratios. 



APPROXIMATE FLAVOUR SYMMETRIES 

First, however, I wish to discuss some qualitative aspects of the above relations. 
For this purpose, I need a crude estimate for the absolute magnitude of the light 
quark masses, which may be obtained with the following simple argument. The mass 
differences between m u , m d and m s are responsible for the splittings observed within 
the multiplets of SU(3). The observed multiplet pattern shows that, replacing a u- 
or a d-quark by an s-quark, the mass of the bound state increases by about 100 or 
200 MeV. Applying the rule of thumb, we infer that the mass differences m s — m u and 



m s — rrid are of this order of magnitude. Since m u and rrid are small compared to m s , 
the mass of the strange quark must be of this order, say m s ~ 150 MeV. With the 
above ratios, this gives rrid — ~^ m s — 7.5 MeV and m u ~ -m^ ~ 5 MeV. I emphasize 
that these estimates only concern the order of magnitude and I will discuss our present 
knowledge at a quantitative level later on. 

The first conclusion to draw is that m u and rrid are surprisingly small. In particular, 
the mass of the proton is large compared to the sum of the masses of the quarks it 
consists of. Indeed, M p does not tend to zero for m u , m^— >0. The amount by which 
it decreases is known from nN scattering:! a = (N\ m u uu + rriddd \ N) = 45 ± 8 MeV. 
This shows that the masses occurring in the Lagrangian of QCD are quite different 
from those used in the various bound state models, m const it U ent — %M P ~ 300 MeV. 

An equally striking aspect of the above pattern is that the three masses are very 
different. In particular, the value for m u /rrid shows that the masses of the u— and 
rf-quarks are quite different. This appears to be in conflict with the oldest and best 
established internal symmetry of particle physics, isospin. Since u and d form an /= | 
multiplet, isospin is a symmetry of the QCD Hamiltonian only if m u = rrid. 

The resolution of the paradox is that m u , rrid are very small. Disregarding the 
e.m. interaction, the strength of isospin breaking is determined by the magnitude of 
\m u —md\, not by the relative size m u /rrid. The fact that rrid is larger than m u by a few 
MeV implies, for instance, that the neutron is heavier than the proton by a few MeV. 
Compared with the mass of the proton, this amounts to a fraction of a per cent. In 
the case of the kaons, the relative mass splitting (M^ c0 —M] c+ )/M] c+ is more important, 
because the denominator is smaller here: The effect is of order (md — m u )/(m u + m s ) ~ 
0.02, but this is still a small number. One might think that for the pions, where the 
square of the mass is proportional to m u + ma, the relative mass splitting should be 
large, of order (M^ — M%+)/M%+ oc (rrid — rriu)/(rnd + r~nu) — 0.3, in flat contradiction 
with observation. It so happens, however, that the pion matrix elements of the isospin 
breaking part of the Hamiltonian, ^(rn u —rrid)(uu — dd), vanish because the group SU(2) 
does not have a d— symbol. This implies that the mass difference between 7r° and 7r + is 
of second order in rrid — m u and therefore tiny. The observed mass difference is almost 
exclusively due to the electromagnetic self energy of the ir + . So, the above quark mass 
pattern is perfectly consistent with the fact that isospin is an almost exact symmetry 
of the strong interaction: The matrix elements of the term ^(m u —rrid)(uu — dd) are very 
small compared with those of H . In particular, the pions are protected from isospin 
breaking. 

QCD also explains another puzzle: Apparently, the mass splittings in the pseu- 
doscalar octet are in conflict with the claim that SU(3) represents a decent approximate 
symmetry. This seems to require M\ ~ M%, while experimentally, M\ ~ 13M^. The 
first order mass formulae yield M\jMl = (m s +m)/(m u +md), where rh = ^(m u + m d ) 
is the mean mass of u and d. The kaons are much heavier than the pions, because 
it so happens that m s is much larger than fh. For SU(3) to be a decent approximate 
symmetry, it is not necessary that the difference m s — m is small with respect to the 
sum m s + rh, because the latter does not represent the relevant mass scale to compare 
the symmetry breaking with. If the quark masses were of the same order of magnitude 
as the electron mass, SU(3) would be an essentially perfect symmetry of QCD; even in 
that world m s ^>m implies that the ratio M^/M^ strongly differs from 1. The strength 
of SU(3) breaking does not manifest itself in the mass ratios of the pseudoscalars, but 
in the symmetry relations between the matrix elements of the operators uu, dd, ss, 
which are used in the derivation of the above mass formulae. The asymmetries in 
these are analogous to the one seen in the matrix elements of the axial vector currents, 



Fk/Fk = 1.22, which represents an SU(3) breaking of typical size. The deviation from 
the lowest order mass formula, 



Mk m s + rh 



Ml m u + m d 



{1 + A M } , (3) 



is expected to be of the same order of magnitude, 1 + 

The Gell-Mann-Okubo formula yields a good check. The lowest order mass formula 
for the r\ reads 

Ml = \{m u + m d + 4m s )B + ... , (4) 

so that the mass relations for 71", K, r\ lead to 3M 2 + M 2 — 4Mj^ = 0. The accuracy within 
which this consequence of SU(3) symmetry holds is best seen by working out the quark 
mass ratio m s /m in two independent ways: While the mass formulae for K and it imply 
mjm = (2M|-M|) /M% = 25.9, those for 77 and vr yield m s /m = § (3M%-M%) /M% = 24.2. 
These numbers are nearly the same - the mass pattern of the pseudoscalar octet is a 
showcase for the claim that SU(3) represents a decent approximate symmetry of QCD, 
despite M\ ~ 13MJ. 



MASS FORMULAE TO SECOND ORDER 

The leading order mass formulae are subject to corrections arising from contribu- 
tions which are of second or higher order in the perturbation H\. A systematic method 
for the analysis of the higher order contributions is provided by the effective Lagrangian 
methodBi In this approach, the quark and gluon fields of QCD are replaced by a set of 
pseudoscalar fields describing the degrees of freedom of the Goldstone bosons it, K, r\. 
The effective Lagrangian only involves these fields and their derivatives, but contains 
an infinite string of vertices. For the calculation of the pseudoscalar masses to a given 
order in the perturbation Hi, however, only a finite subset contributes. The term Am, 
which describes the SU(3) corrections in the ratio M^/M^ according to eq. involves 
the two effective coupling constants L 5 and L$, which occur in the derivative expansion 
of the effective Lagrangian at first non-leading order:! 

A M = 8(M "" M ' ) (2L 8 -L 5 )+xlogs . (5) 

7T 

The term x^°S s stands for the logarithms characteristic of chiral perturbation theory. 
They arise because the spectrum of the unperturbed Hamiltonian Hq contains massless 
particles - the perturbation H\ generates infra-red singularities. The coupling constant 
L 5 also determines the SU(3) asymmetry in the decay constants, 

The comparison of eqs. (0) and (^) confirms that the symmetry breaking effects in the 
decay constants and in the mass spectrum are of similar nature. The calculation! also 
reveals that the first order SU(3) correction in the mass ratio {M\ a —M\^)j(M\ — M'^) 
is the same as the one in M^/M^: 

M 2 K0 -M 2 K+ m d -m u 2 



M 2 K - Ml ' m s -m 



{l + A M + 0(m 2 )} . (7) 



Hence, the first order corrections drop out in the double ratio 

n2 = Ml Ml- Ml 

The observed values of the meson masses thus provide a tight constraint on one par- 
ticular ratio of quark masses: 

Q 2 = 4^{l + 0(m 2 )} • (9) 

The constraint may be visualized by plotting the ratio m s /md versus m u /md$ Dropping 
the higher order contributions, the resulting curve takes the form of an ellipse: 

^) + ^U) =1 < (10) 

with Q as major semi-axis (the term m 2 /m 2 has been discarded, as it is numerically 
very small). 



VALUE OF Q 

The meson masses occurring in the double ratio (§) refer to pure QCD. The Dashen 
theorem states that in the chiral limit, the electromagnetic contributions to M|- + and to 
Ml+ are the same, while the self energies of K° and tt° vanish.! Since the contribution 
to the mass difference between 7r° and ir + from m d —m u is negligibly small, the masses 
in pure QCD are approximately given by 

(M 2 +) QCD = (M 2 o) QCD = M 2 o ^ 

(M 2 +) QCD = M £ + _ M 2 ++M 2 o j (M 2_ o) QCD = ^ 

where M n o , M 7r + , M K o , M K + are the observed masses. Correcting for the electromag- 
netic self energies in this way, the lowest order formulae becomea 

m u _ M 2 K+ - M 2 K0 + 2Ml - Ml + _ 

r2 _ rT72 -U.00 , [LL) 



m d M 2 R0 - M 2 K+ + Ml + 

m s ^ Mfa + M 2 K+ - Ml + 
m d ~ M 2 - M 2 + M 2 



20.1 



L K° 1V1 K^ 

The corresponding expression for the semi-axis Q reads 

2 _ (M 2 K0 + M 2 K+ - M 2 n+ + Ml )(M 2 K0 + M 2 K+ - Ml + - M, 2 ) 

4Ml (M 2 K0 -M 2 K+ + Ml + -M 2 n0 ) ■ [LA) 

Numerically, this yields Qd = 24.2. The corresponding ellipse is shown in fig. 1 as a 
dash-dotted line. For this value of the semi-axis, the curve passes through the point 



specified by Weinberg's mass ratios, eq. (JTT 



The Dashen theorem is subject to corrections from higher order terms in the chiral 
expansion. As usual, there are two categories of contributions: loop graphs of order 
e 2 m and terms of the same order from the derivative expansion of the effective e.m. 
Lagrangian. The Clebsch-Gordan coefficients occurring in the loop graphs are known to 
be large, indicating that two-particle intermediate states generate sizeable corrections; 
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Figure 1. Elliptic constraint. The dot indicates Weinberg's mass ratios. The dash-dotted line 
represents the ellipse for the value Q = 24.2 of the semi-axis, obtained from the mass difference 
K° — K + with the Dashen theorem. The full line and the shaded region correspond to Q — 22.7 ±0.8, 
as required by the observed rate of the decay 77 — > ir + ir~ ir° . 

the corresponding chiral logarithms tend to increase the e.m. contribution to the kaon 
mass difference.i The numerical result depends on the scale used when evaluating the 
logarithms. In fact, taken by themselves, chiral logs are unsafe at any scale - one at the 
same time also needs to consider the contributions from the terms of order e 2 m occurring 
in the effective Lagrangian. This is done in several recent papers, but the results are 
controversial. Donoghue, Holstein and Wyle estimate the contributions arising; from 
vector meson exchange and conclude that these give rise to large corrections, increasing 
the value (M K + —M K o) em ^ = 1.3 MeV predicted by Dashen to 2.3 MeV. According to 
refs.0 however, the model used is in conflict with chiral symmetry: Although the 
perturbations due to vector meson exchange are enhanced by a relatively small energy 
denominator, chiral symmetry prevents them from being large. In view of this, it is 
puzzling that Bijnens,H3 who evaluates the self energies within the model of Bardeen et 
al.,0 finds an even larger effect, (M K + — M K o) e rn ^ ~2.6 MeV. 

Recently, the electromagnetic self energies have been analyzed within lattice QCD.0 
The result of this calculation, (M^-+— M^-o) e m =1.9 MeV, indicates that the corrections 
to the Dashen theorem are indeed substantial, although not quite as large as found in 
refs. EM. The uncertainties of the lattice result are of the same type as those occuring 
in direct determinations of the quark masses with this method. The mass difference 
between K + and K°, however, is predominantly due to > m u , not to the e.m. 
interaction. An error in the self energy of 20% only affects the value of Q by about 
3%. The terms neglected when evaluating Q 2 with the meson masses are of order 
(Mf - M2) 2 /M 4 , where M is the mass scale relevant for the exchange of scalar or 
pseudoscalar states, M ~ M5 ~ M v i . The corresponding error in the result for Q is also 
of the order of 3% - the uncertainties in the value Q = 22.8 that follows from the lattice 
result are significantly smaller than those obtained for the quark masses with the same 
method. The implications of the above estimates for the value of Q are illustrated on 
the r.h.s. of fig. 2. 

The isospin-violating decay 77 — ^ 37r allows an entirely independent measurement 
of the semi-axisiil The transition amplitude is much less sensitive to the uncertainties 
associated with the electromagnetic interaction than the K°—K + mass difference: The 
e.m. contribution is suppressed by chiral symmetry and is negligibly small.0 The tran- 
sition amplitude thus represents a sensitive probe of the symmetry breaking generated 
by rrid — m u . To lowest order in the chiral expansion (current algebra), the amplitude 




Figure 2. The l.h.s. indicates the values of Q corresponding to the various experimental results for 
the rate of the decay r\ — > 7r + 7r~7r°. The r.h.s. shows the results for Q obtained with three different 
theoretical estimates for the electromagnetic self energy of the kaons. 



of the transition 7] — > 7t + tt tt° is given by 

V3 m d -m 1 j 
4 m s -rh F% K 3 w ' 

where s is the square of the centre-of-mass energy of the charged pion pair. The 
corrections of first non-leading order (chiral perturbation theory to one loop) are also 
known. It is convenient to write the decay rate in the form r r? ^ 7r + 7r - 7r o = T {Qd/Q) a , 
where Qo is specified in eq. (p!2|). As shown in ref.lll, the one-loop calculation yields a 
parameter free prediction for the constant T . Updating the value of F n , the numerical 
result reads T = 168 ± 50 eV. Although the calculation includes all corrections of 
first non-leading order, the error bar is large. The problem originates in the final 
state interaction, which strongly amplifies the transition probability in part of the 
Dalitz plot. The one-loop calculation does account for this phenomenon, but only 
to leading order in the low energy expansion. The final state interaction is analyzed 
more accurately in two recent papers, j3@ which exploit the fact that analyticity and 
unitarity determine the amplitude up to a few subtraction constants. For these, the 
corrections to the current algebra predictions are small, because they are barely affected 
by the final state interaction. Although the dispersive framework used in the two papers 
differs, the results are nearly the same: While Kambor, Wiesendanger and Wyler obtain 
To = 209 ± 20 eV, we get Tq = 219 ± 22 eV. This shows that the theoretical uncertainties 
of the dispersive calculation are small. Since the decay rate is proportional to Q~ A , the 
transition rj—>-37T represents an extremely sensitive probe, allowing a determination of 



Q to an accuracy of about l\ 



Unfortunately, however, the experimental situation is not clear.113 The value of 
T^k+k-kQ relies on the rate of the decay into two photons. The two different methods 
of measuring T^ 77 - photon-photon collisions and Primakoff effect - yield conflicting 
results. While the data based on the Primakoff effect are in perfect agreement with 
the number Q = 24.2 which follows from the Dashen theorem, the 77 data yield a 
significantly lower result (see l.h.s. of fig. 2). The statistics is dominated by the 77 
data. Using the overall fit of the Particle Data Group,0 T^^+^-^o = 283 ± 28 eV and 
adding errors quadratically, we obtain Q = 22.7 ± 0.8, to be compared with the value 
Q = 22.4 ± 0.9 given in ref.0. The result appears to confirm the lattice calculation.0 
The above discussion makes it clear that an improvement of the experimental situation 
concerning r ?) _ >77 is of considerable interest. 



A PHENOMENOLOGICAL AMBIGUITY 



Chiral perturbation theory thus fixes one of the two quark mass ratios in terms of 
the other, to within small uncertainties. The ratios themselves, i.e. the position on the 
ellipse, are a more subtle issue. Kaplan and ManoharB pointed out that the corrections 
to the lowest order result, eq. (0), cannot be determined on purely phenomenological 
grounds. They argued that these corrections might be large and that the w-quark might 
actually be massless. This possibility is widely discussed in the literature,!^ because 
the strong CP problem would then disappear. 

The reason why phenomenology alone does not allow us to determine the two indi- 
vidual ratios beyond leading order is the following. The matrix 

m' = aim + a2(m + )~ det m 

transforms in the same manner as m. For a real, diagonal mass matrix, the transfor- 
mation amounts to 

m' u = a.\m u + a 2 m d m s (cycl. u — > d — > s — > u) . (13) 

Symmetry does therefore not distinguish vnl from m. Since the effective theory exclu- 
sively exploits the symmetry properties of QCD, the above transformation of the quark 
mass matrix does not change the form of the effective Lagrangian - the transforma- 
tion may be absorbed in a suitable change of the effective coupling constants.! This 
implies, however, that the expressions obtained with this Lagrangian for the masses of 
the pseudoscalars, for the scattering amplitudes, as well as for the matrix elements of 
the vector and axial currents are invariant under the operation m — > m! . Conversely, 
the experimental information on these observables does not allow us to distinguish m' 
from m. 

Within the approximations used, we may equally well write the mass ratio that 
characterizes the ellipse in the form Q 2 = {m 2 — \m 2 l — \m 2 d )/{m 2 d — m 2 ). Up to terms 
of order m 4 , which are beyond the accuracy of our formulae, the differences of the 
squares of the quark masses are invariant under the transformation ([13]). Hence Q 2 is 
invariant, i.e. the ellipse is mapped onto itself. The position on the ellipse, however, 
does not remain invariant and can therefore not be determined on the basis of chiral 
perturbation theory alone. 

We are not dealing with a hidden symmetry of QCD here - this theory is not 
invariant under the change (p~3|) of the quark masses. In particular, the matrix elements 
of the scalar and pseudoscalar operators are modified. The Ward identity for the 
axial current implies, for example, that the vacuum-to-pion matrix element of the 
pseudoscalar density is given by 

(0\d i'y 5 u\n + ) = V2F 1T Ml + /(m u + m d ) . (14) 

The relation is exact, except for electroweak corrections. It involves the physical quark 
masses and is not invariant under the above transformation. Unfortunately, however, 
an experimental probe sensitive to the scalar or pseudoscalar currents is not available 
- the electromagnetic and weak interactions happen to probe the low energy structure 
of the system exclusively through vector and axial currents. 



ESTIMATES AND BOUNDS 



I now discuss the size of the corrections to the leading order formulae ([Tl]) for the 
two quark mass ratios m u /md and m s /md. For the reasons just described, this discus- 
sion necessarily involves a theoretical input of one sort or another. To clearly identify 
the relevant ingredient, I explicitly formulate it as hypothesis A, B, . . . 

Hypothesis A: Assume that the corrections of order m 2 or higher are small 
and neglect these. 

This is the attitude taken in early work on the problem.0 In the notation used above, 
the assumption amounts to A A / ~ 0, so that m s /rh ~ (2M|- — M%)/M% ~ 26. In the 
plane spanned by m u /m^ and m s /m^, this represents a straight line. The intersection 
with the ellipse then fixes things. It is convenient to parametrize the position on the 
ellipse by means of the ratio R, which measures the relative size of isospin and SU(3) 
breaking, 

m s — rh , 

R = — . 15 

m d - m u 

With the value Q = 24.2 (Dashen theorem), the intersection occurs at the mass ratios 
given by Weinberg, which correspond to i?~43. For the value of the semi-axis which 
follows from 77 decay, Q = 22.7, the intersection instead takes place at _R~39. 

The baryon octet offers a good test: Applying the hypothesis to the chiral expansion 
of the baryon masses, i.e. disregarding terms of order m 2 , one arrives at three inde- 
pendent estimates for R, viz. 51 ± 10 (N-P), 43 ± 4 (E~-£ + ) and 42 ± 6 (S _ -S°).Q 
Within the errors, these results are consistent with the values R ~ 43 and 39, obtained 
above from K° — K + and from 77 — > 7r + 7r~7r°, respectively. A recent reanalysis of p — uo 
mixing^ leads to R = 41 ± 4 and thus corroborates the picture further. 

Another source of information concerning the ratio of isospin and SU(3) breaking ef- 
fects is the branching ratio V^^^o/Y^i^^, The chiral expansion of the corresponding 
ratio of transition amplitudes starts witha 

(V>7r | qmq\ip') 3y3 
(ipr]\qmq\ijj') 4R 



{1 + A^ + ...} 



Disregarding the correction A^/, which is of order m s — rh, the data imply i? = 31 ±4, 
where the error bar corresponds to the experimental accuracy of the branching ra- 
tio. The value is significantly lower than those listed above. The higher order cor- 
rections are discussed in ref.E3, but the validity of the multipole expansion used there 
is questionable.^ The calculation is of interest, because it is independent of other 
determinations, but at the present level of theoretical understanding, it is subject to 
considerable uncertainties. Since the quark mass ratios given in refsHj rely on the value 
of R obtained in this way, they are subject to the same reservations. Nevertheless, the 
information extracted from ip' decays is useful, because it puts an upper limit on the 
value of R. As an SU(3) breaking effect, the correction A^i is expected to be of order 
25%. The estimate |A^/| < 0.4 is on the conservative side. Expressed in terms of R, 
this implies R < 44. 



3 

*Note that, in this case, the expansion contains terms of order , which do play a significant role 
numerically. The error bars represent simple rule-of-thumb estimates, indicated by the noise visible 
in the calculation. For details see ref.B. 



Hypothesis B: Assume that the effective coupling constants are dominated 
by the singularities which are closest to the origin. 



This amounts to a generalization of the vector meson dominance hypothesis and 
yields rough estimates for the various coupling constants, for instance,^ 

F 2 F 2 F 2 

L* * WTT, , • • ■ (16) 



° 4M„ 2 ' ' 48 M 2 , ' 3 2M? 

In all cases where direct phenomenological information is available, these estimates do 
remarkably well. Also, this framework explains why it is justified to treat m s as a 
perturbatiomil at order p 4 , the symmetry breaking part of the effective Lagrangian is 
determined by the constants L4, . . . , L$- These are immune to the low energy singular- 
ities generated by spin 1 resonances, but do receive contributions from the exchange of 
scalar or pseudoscalar particles. Assuming that the pole terms generated by the light- 
est such particles dominate, the magnitude of these coupling constants is determined 
by the scale M ao ~ M v / ~ 1 GeV. According to eq. the asymmetry in the decay 
constants, for instance, is given by 

£ . 1 + M_M + , logs . (17) 

This shows that the breaking of the chiral and eightfold way symmetries is controlled 
by the mass ratio of the Goldstone bosons to the non-Goldstone states of spin zero, 
M\jM 2 ^ ~ M\jM 2 , — \- In chiral perturbation theory, the observation that the 
Goldstones are the lightest hadrons thus acquires quantitative significance. 

Since the coupling constant L 5 and the combination L 5 — 12L 7 — 6L 8 are known 
experimentally (from F K /F n and 3M 2 + M 2 — 4M|-, respectively), we may express A M 
in terms of known quantities, except for a contribution from L 7 . Inserting the estimate 
( |i~6"|) for this constant, we obtain a small, negative number@ A^ ~ —0.16. Unfortu- 
nately, the result is rather sensitive to the uncertainties of the saturation hypothesis, 
because the contributions from Lj and from the remainder are of opposite sign and 
thus partly cancel. This is illustrated by the following observation. The constant Lj 
enters through its contribution to the mass of the 77. When replacing the term with 
the one from ?/-exchange, the corresponding energy denominator, (M3 — p 2 )~ 1 , should 
be evaluated at p 2 = M 2 . In the formula flUD , the denominator is replaced by the 
corresponding leading order term, (M^,) _1 . The neglected higher order effects are not 
exceedingly large, but they reduce the numerical value of the prediction for A M by a 
factor of 2. 

The breaking of SU(3l induces rj — rj' mixing. When analyzing this effect within 
chiral perturbation theory J we noticed that the observed value of M v requires a mixing 
angle that is about twice as large as the canonical value l^/J ~ 10° accepted at that 
time. The conclusion was confirmed experimentally soon thereafter.!! We may now 
turn the argument around,0 use the phenomenology of the mixing angle to estimate 
the magnitude of Lj and then determine the size of Am- For a mixing angle in the range 
20° < 6, n ,, n < 25°, this leads to -0.06 < A M < 0.09. In this calculation, the energy 
denominator is evaluated at the proper momentum, but the uncertainties arising from 
the cancellation of two contributions remain. 

Quite irrespective of these uncertainties, the result for Am is a very small number: 
The hypothesis that the low energy constant L-j is dominated by the singularity due 
to the 7/ implies that the corrections to the lowest order mass formula for M\jM 2 are 
small. In view of the elliptic constraint, this amounts to the statement that A follows 
from B. 



Hypothesis C: Assume that the large-iV c expansion makes sense for iV c = 3. 



As noted already in refBa, the ambiguity discussed above disappears in the large- 
N c limit, because the Kaplan-Manohar transformation violates the Zweig rule. In this 
limit, the structure of the effective theory is modified, because the U(l)-anomaly is 
then suppressed, so that the spectrum contains a ninth Goldstone boson, which is 
identified with the r/.0 The implications for the effective Lagrangian are extensively 
discussed in the literature and the leading terms in the expansion in powers of 1/N C 
are well-known.EH More recently, the analysis was extended to first non-leading order, 
accounting for all terms which are suppressed either by one power of 1/N C or by one 
power of the quark mass matrix.^ This framework leads to a bound for Am, which 
arises as follows. 

At leading order of the chiral expansion, the mass of the rj is given by the Gell- 
Mann-Okubo formula. At the next order of the expansion, there are two categories 
of corrections: (i) The first is of the same origin as the one which occurs in the mass 
formula (|3|) for the ratio M\jMl and is also determined by Am- The expression for the 
mass of the rj, which follows from the Gell-Mann-Okubo formula, M 2 = |(4M|- — M%), 
is replaced by 

m \ = \{AMl - Ml) + f (M£ - Ml) A M . 

(ii) In addition, there is mixing between the two states rj, rj'. The levels repel in propor- 
tion to the square of the transition matrix element U\ oc (i]'\qmq\r]), so that the mass 
formula for the r] takes the form 

M ^ m ?-5F^T (18) 

This immediately implies the inequality M^<m\, i.e. 

4M| - 3M„ 2 - Ml 

> ~ mi-Mi)' - ~°- m • 

At leading order of the expansion, the transition matrix element 0\ is given by 
o"o = \ > \[2{M\ — Ml). There are again two corrections of first non-leading order: 
ci = Co (1 + Am — An)- The first is an SU(3) breaking effect of order m s — m, 
determined by Am, while An represents a correction of order 1/N C of unknown size 
- the mass formula (JlSf) merely fixes An as a function of Am or vice versa: As Am 
grows, A N decreases. A coherent picture, however, only results if both \A M \ and |Ajv| 
are small compared with unity. If the above inequality were saturated, <j\ would have 
to vanish, i.e. 1 + A^r — Am = 0. In other words, the corrections would have to cancel 
the leading term. It is clear that, in such a situation, the expansion is out of control. 
Accordingly, Am must be somewhat larger than —0.07. Even Am = calls for large 
Zweig rule violations, A N — §■ The condition 

Am>0 (19) 

thus represents a generous lower bound for the region where a truncated 1/N C expansion 
leads to meaningful results. It states that the current algebra formula, which relates 
the quark mass ratio m s /m to the meson mass ratio M|-/M 2 , represents an upper limit, 
m s /m<2M 2 K /Ml - 1 = 25.9. 

This shows that A, B and C are mutually consistent, provided Am is small and 
positive. The bound (|T9|) is shown in fig. 3: Mass ratios in the hatched region are 



in conflict with the hypothesis that the first two terms of the 1/N C expansion yield 
meaningful results for N c = 3. Since the Weinberg ratios correspond to Am = 0, they 
are located at the boundary of this region. In view of the elliptic constraint, the bound 
in particular implies m u /m d > |. 

Hypothesis D: Assume that m u vanishes. 

It is clear that this assumption violates the large-iV c bound just discussed. D is also 
inconsistent with A and B. In fact, as pointed out in refs.ll, this hypothesis leads to a 
very queer picture, for the following reason. 

The lowest order mass formulae (|]) and ([|) imply that the ratio m u / m d determines 
the K°/K + mass difference, the scale being set by 

M 2 K0 -M 2 K+ = md ~ mu M 2 n + ... 
m u + m d 

The formula holds up to corrections from higher order terms in the chiral expansion and 
up to e.m. contributions. Setting m u = 0, the relation predicts M K o — M K + ~ 16MeV, 
four times larger than the observed mass difference. The disaster can only be blamed 
on the higher order terms, because the electromagnetic self energies are much too small. 
Under such circumstances, it does not make sense to truncate the expansion at first 
non-leading order. The conclusion to be drawn from the assumption m u = is that 
chiral perturbation theory is unable to account for the masses of the Goldstone bosons. 
It is difficult to understand how a framework with a basic flaw like this can be so 
successful. 

The assumption m u = also implies that the matrix elements of the scalar and 
pseudoscalar currents must exhibit very strong SU(3) breaking effects. @ Consider e.g. 
the pion and kaon matrix elements of the scalar operators uu,dd,ss. In the limit 
md = m s , the ratio 



(7T+ 


j uu — ss\ 


7T + ) 


(K+\ 


uu — dd\K + ) 



is equal to 1. The SU(3) breaking effects are readily calculated by working out the 
derivatives of M 2 + ,M^ + with respect to m u ,m d ,m s . Neglecting the chiral logarithms 
which turn out to be small in this case, the first order corrections may be expressed in 
terms of the masses, 



m s -m u M| - M 2 



m d -m u M 2 RQ - M 2 K+ 



1 + 0(m 2 



The relation is of the same character as the one that leads to the elliptic constraint: The 
corrections are of second order in the quark masses. For m u = 0, this constraint reduces 
to m s /m d = Q + |, so that the relation predicts r ~ 3, the precise value depending on 
the number used for the electromagnetic contribution to — M^o. So, m u = leads 
to the prediction that the evaluation of the above matrix elements with sum rule or 
lattice techniques will reveal extraordinarily strong flavour symmetry breaking effects 
- a bizarre picture. For me this is enough to stop talking about m u = here. 



MAGNITUDE OF m s 



Finally, I briefly comment on the absolute magnitudes of the quark masses. The 
effective low energy theory does not allow us to determine these phenomenologically, 




Figure 3. Quark mass ratios. The dot corresponds to Weinberg's values, while the cross represents 
the estimates given in veiB. The hatched region is excluded by the bound Am > 0. The error ellipse 
shown is characterized by the constraints Q = 22.7 ± 0.8, Am > 0, R < 44, which are indicated by 
dashed lines. 

because the low energy constant B cannot be measured directly. Since the ratios 
m u : ma '■ m s are known to within small uncertainties, it suffices to discuss the available 
information in terms of one of the three masses - I consider the value of m s . 

The best determinations of the absolute magnitude of m u , md and m s rely on QCD 
sum rules." A detailed discussion of the method in application to the mass spectrum 
of the quarks was given in 1982.1 Fixing the running scale at fj, — 1 GeV, the result 
quoted for the MS running mass of the strange quark is m s = 175 ± 55 MeV. The issue 
has been investigated in considerable detail since then.0 The value given in the most 
recent paper,il 

m s = 175 ±25 MeV , (20) 

summarizes the state of the art: The central value is confirmed and the error bar 
is reduced by about a factor of two. The residual uncertainty mainly reflects the 
systematic errors of the method, which it is difficult to narrow down further. 

There is considerable progress in the numerical simulation of QCD on a latticed 
For gluodynamics and bound states of heavy quarks, this approach already yields sig- 
nificant results. The values obtained for m s are somewhat smaller than the one given 
above. The APE collaboration,!^ for instance, reports m s = 128 ± 18 MeV for the 
MS running mass at \i — 2 GeV. It is difficult, however, to properly account for the 
vacuum fluctuations generated by quarks with small masses. Further progress with 
light dynamical fermions is required before the numbers obtained for m u , or m s can 
be taken at face value. In the long run, however, this method will allow an accurate 
determination of all of the quark masses. 



CONCLUSION 

Light quark effective theory represents a coherent theoretical framework for the 
analysis of the low energy structure of QCD. The method has been applied to quite a few 
matrix elements of physical interest and the predictions so obtained have survived the 
experimental tests performed until now. In these lectures, I focussed on the implications 



for the ratios of the light quark masses, where the method leads to the following results: 
1. The ratios m B /mj and m s /m d are constrained to an ellipse, whose small semi- 
axis is equal to 1, 

'm u \ 2 1 /m s \ 2 
,m d J Q 2 \m d ) 
r) decay yields a remarkably precise measurement of the large semi-axis, 

Q = 22.7 ±0.8 . 

Unfortunately, however, the experimental situation concerning the lifetime of the 77 is 
not satisfactory - the given error bar relies on the averaging procedure used by the 
Particle Data Group. 

2. The position on the ellipse cannot accurately be determined from phenomenology 
alone. The theoretical arguments given imply that the corrections to Weinberg's leading 
order mass formulae are small. In particular, there is a new bound based on the 1/N C 
expansion, which requires m u /m d > ^ and thereby eliminates the possibility that the 
M-quark is massless. 

3. The final result for the quark mass ratios is indicated by the shaded error ellipse in 
fig. 3, which is defined by the following three constraints: (i) On the upper and lower 
sides, the ellipse is bounded by the two dashed lines that correspond to Q — 22.7 ±0.8. 
(ii) To the left, it touches the hatched region, excluded by the large- N c bound, (iii) On 
the right, I use the upper limit R < 44, which follows from the observed value of the 
branching ratio r^i^ n o/T^i^ v . The corresponding range of the various parameters 
of interest isEll 

TTl TTl TTl 

— = 0.553 ± 0.043 , — = 18.9 ±0.8 , — = 34.4 ±3.7, 

m d m d m u 

m R — rh . _ _ m s 



40.8 ± 3.2 , = 24.4 ± 1.5 , A M = 0.065 ± 0.065 . 



While the central value for m u j m d happens to coincide with the leading order formula, 
the one for m s /m d turns out to be slightly smaller. The difference, which amounts to 
6%, originates in the fact that the available data on the rj lifetime imply a somewhat 
smaller value of Q than what is predicted by the Dashen theorem. 

4. The theoretical arguments discussed as hypotheses A, B and C are perfectly con- 
sistent with these numbers. In particular, the early determinations of R, based on 
the baryon mass splittings and on p — u mixing,0 are confirmed. The rough estimate 
m s /m = 29 ±7, obtained by Bijnens, Prades and de Rafael from QCD sum rules,@ pro- 
vides an independent check: The lower end of this interval corresponds to Am < 0.17. 
Fig. 3 shows that this constraint restricts the allowed region to the right and is only 
slightly weaker than the condition R < 44 used above. 

5. The mass of the strange quark is known quite accurately from QCD sum rules: 

m s = 175 ± 25 MeV (MS scheme at p. = 1 GeV) . 
Using this value, the above ratios then determine the size of m u and m d : 



m u = 5.1 ± 0.9 MeV , m d = 9.3 ± 1.4 MeV 
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